CDS

Accession Number TCMCG024C01629
gbkey CDS
Protein Id XP_021970711.1
Location complement(join(58386847..58386984,58387066..58387116,58387202..58387295,58387398..58387519,58387628..58387723,58387855..58388064,58388199..58388273,58388395..58388530,58388607..58388652,58389224..58389512))
Gene LOC110865695
GeneID 110865695
Organism Helianthus annuus

Protein

Length 418aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA396063
db_source XM_022115019.2
Definition choline monooxygenase, chloroplastic [Helianthus annuus]

EGGNOG-MAPPER Annotation

COG_category P
Description choline monooxygenase
KEGG_TC -
KEGG_Module -
KEGG_Reaction R07409        [VIEW IN KEGG]
KEGG_rclass RC00087        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K00499        [VIEW IN KEGG]
EC 1.14.15.7        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00260        [VIEW IN KEGG]
map00260        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGTCAACGATAATGATGATCACGATCAACCCCACTAATCTTCAATTCTCTAACCTGAACAAAACTAGAAACCCATATCCAAATTCACACCTACAACACACCCGGTCACTCAAATCATCACAAATTAGCAACACCCATGAACCCAAATCATCTCATTCACTGGTCCAACAATTCAACCCAAATATCCCAATTCAAGAAGCCCTAACCCCACCCAGTTCTTGGTACACTTCCCCTGAGTTTCTCTCTCTAGAATTTGATCAAGTATTCTTCAAAGGATGGCAGGCTGTTGGATGTACTGCACAAGTTCAGGAGGCCAACAGTTTCTTTACTGGAAGATTAGGAAACATAGAATATGTGGTGTGTCGTGACGAAAATGGCGAGTTGCGTGCGTTTCATAATGTTTGTCGCCACCATGCCTCACTTCTAGCATTTGGAAGTGGAAAAGGAACTTGCTTTACATGCCCTTATCATGGATGGACATACGGGTTGAATGGAGCACTTCTGAAAGCAACCAGAATAACAGGGATGAAGAACTTTAATGTCAAAGAGTTCGGACTCGTTCCATTGAGCGTGGCCATTTGGGGGCCATTTATCCTTCTCAATATGGAAAAAGAGGTGTTTTCCCAACAAGGTTGCGATGATGATGTCGGAATGGAATGGCTAGGTAGCTCTTCCGAGATATTAAGCACCAATGGGGTCGATAATTCTCTAAGTTATCTTTGCAGACGCGAATATACTATCGAGTGCAATTGGAAGGTGTTTTGTGACAATTACTTAGATGGCGGGTATCATGTACCTTTCGCGCATAAAGATCTTGCATCAGGTCTTAAGCTCGACTCGTATTCCACCACAGTGTATGAGAAAGTGAGCATACAAAGATGTGATGGAGGTGAAGTGCAGGGTCAAGAAGATTTTGATAGGCTTGGATCTAAATCCTTATATGCCTTTATTTATCCAAACTTCATGGTGAATAGGTACGGGCCATGGATGGACACCAATCTAGTACTTCCATTAGGACCCCGACGATGCAAAGTGATTTTTGATTACTTCCTTGATGCGTCTCTTAAGGATGATGAAGCTTTTGTGGCTAAGAGTTTAGAAGACAGTGAGAAAGTTCAGATGGAAGATATCATGCTGTGTGAATCGGTTCAGAGAGGGTTGGAATCGCCAGCTTATGACAGCGGCCGATATGCGCCTATGGTAGAGAAGGCCATGCATCATTTTCATTGTCTGCTACATCAAAATCTCATCAAATGA
Protein:  
MSTIMMITINPTNLQFSNLNKTRNPYPNSHLQHTRSLKSSQISNTHEPKSSHSLVQQFNPNIPIQEALTPPSSWYTSPEFLSLEFDQVFFKGWQAVGCTAQVQEANSFFTGRLGNIEYVVCRDENGELRAFHNVCRHHASLLAFGSGKGTCFTCPYHGWTYGLNGALLKATRITGMKNFNVKEFGLVPLSVAIWGPFILLNMEKEVFSQQGCDDDVGMEWLGSSSEILSTNGVDNSLSYLCRREYTIECNWKVFCDNYLDGGYHVPFAHKDLASGLKLDSYSTTVYEKVSIQRCDGGEVQGQEDFDRLGSKSLYAFIYPNFMVNRYGPWMDTNLVLPLGPRRCKVIFDYFLDASLKDDEAFVAKSLEDSEKVQMEDIMLCESVQRGLESPAYDSGRYAPMVEKAMHHFHCLLHQNLIK